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Abstract 

Graphs  change  over  time,  and  typically  variations  on  the  small  multiples  or  anima¬ 
tion  pattern  is  used  to  convey  this  dynamism  visually.  However,  both  of  these  classical 
techniques  have  significant  drawbacks,  so  a  new  approach,  Storyline  Visualization  of 
Events  on  a  Network  (SVEN)  is  proposed.  SVEN  builds  on  storyline  techniques,  con¬ 
veying  nodes  as  contiguous  lines  over  time.  SVEN  encodes  time  in  a  natural  manner, 
along  the  horizontal  axis,  and  optimizes  the  vertical  placement  of  storylines  to  decrease 
clutter  (line  crossings,  straightness,  and  bends)  in  the  drawing.  This  paper  demon¬ 
strates  SVEN  on  several  different  flavors  of  real-world  dynamic  data,  and  outlines  the 
remaining  near-term  future  work. 


1  Introduction 

Graphs  have  become  an  essential  mathematical  tool  to  aid  in  the  representation  and  anal¬ 
ysis  of  relationships  between  discrete  entities  such  as  people,  companies,  computers,  etc. 
Not  surprisingly,  visualization  of  graphs  (i.e.,  graph  drawing)  has  been  an  active  area  of 
research  for  the  past  several  decades.  However,  the  world  is  a  dynamic  place,  so  graphs 
that  model  objects  and  their  relationships  in  the  world  often  change  over  time.  To  account 
for  this,  one  often  chooses  a  particular  time  interval  of  interest  and  constructs  a  single 
graph  capturing  the  relationships  observed  in  that  interval.  This  is  the  de  facto  method  for 
social  network  analysis,  where  the  graph  is  constructed  from  a  series  of  interviews,  ques¬ 
tionnaires,  or  observations  occurring  within  some  time  interval  (5%j.  Visualization  of  the 
graph  often  plays  a  crucial  role  in  the  initial  understanding  of  the  phenomena  of  interest. 
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Figure  1:  Representation  of  a  sequence  of  events  on  a  network  as  a  directed  labeled  multi¬ 
graph.  Labels  on  edges  represent  the  time  of  the  event. 


There  is  no  reason  to  require  that  such  data  must  be  compressed  into  a  single  graph  aside 
from  convenience.  On  the  contrary,  one  would  expect  a  longitudinal  analysis  to  potentially 
yield  more  insight  into  dynamic  phenomena  than  a  more  restrictive  static  analysis.  A 
typical  way  to  understand  a  dynamic  graph  is  to  induce  a  sequence  of  subgraphs 

{Gi  =  (Ri,  £d),  G2  =  (V2,  E2), ...}, 

by  partitioning  a  time  interval  into  several  smaller  non-overlapping  time  windows.  Alter¬ 
natively  we  can  consider  a  sequence  of  interaction  events  on  a  continuous  timescale  as  a 
set  of  time-edge  tuples  of  the  form 

{(ii,(wi,wi)),(i2,(«2,V2)),-}, 

where  the  ith  interaction  occurs  at  time  U  and  involves  Ui  and  u*,  and  reads  as  “at  time  U 
node  u  interacted  with  node  u.”  For  simplicity  we  assume  that  interactions  are  pairwise  and 
of  negligible  duration  relative  to  the  timescale  of  the  visualization;  this  type  of  network 
has  been  referred  to  as  a  “contact  sequence”  [26],  and  an  example  is  shown  in  Fig.  [I] 
Many  relevant  real-world  phenomena  can  be  described  in  this  manner,  such  as  a  group  of 
authors  publishing  a  paper  together,  money  being  wired  between  bank  accounts,  one  or 
more  proteins  activating  or  inhibiting  another  protein,  and  dialogue  between  characters  a 
play  or  novel. 

Given  that  node-link  visualizations  of  static  graphs  are  wildly  popular,  it  is  not  surpris¬ 
ing  that  demand  for  effective  dynamic  graph  visualization  capability  is  increasing.  Perhaps 
what  is  surprising  is  that,  to  date,  no  general  purpose  framework  for  dynamic  graph  visu¬ 
alization  has  become  widely  accepted  in  the  same  way  that  force  directed  placement  has 
been  for  static  graph  visualization.  Research  in  dynamic  graph  visualization  began  with 
showing  the  changes  to  the  graph  either  as  a  sequence  of  still  images  (e.g.,  small  multiples) 
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for  each  Gi,  or  as  an  animation  that  interpolates  between  these  drawings  [35l  H] .  While 
this  approach  to  dynamic  graph  visualization  problem  inherits  the  challenges  of  optimizing 
the  aesthetic  properties  of  the  static  views  of  the  graphs,  it  also  introduces  a  new  problem 
of  how  to  change  those  views  in  a  way  that  doesn’t  confuse  or  mislead  the  viewer.  It  is 
hypothesized  that  the  change  between  consecutive  drawings  should  small  to  help  preserve 
the  user’s  “mental  map”  mm- 

However,  even  if  the  mental  map  is  well  preserved,  both  small  multiples  and  animation 
have  significant  drawbacks.  The  small  multiples  technique  must  decrease  the  size  of  the 
individual  drawings  in  order  to  fit  the  entire  collection  on  screen,  and  it  can  be  tasking 
to  trace  nodes  and  their  relationships  as  they  jump  from  one  view  to  the  next.  With 
animated  drawings,  motion  can  distract  the  user  when  that  motion  is  an  artifact  of  the 
layout  algorithm  and  not  reflective  of  an  important  change  to  the  graph  structure.  Also, 
the  user  will  be  burdened  by  seeking  back  and  forth  within  the  animation  to  understand 
the  important  changes  to  the  graph.  While  annotating  an  image  is  as  trivial  writing  on  a 
printed  version,  annotating  an  animation  requires  specialized  software,  and  the  animation 
should  be  rendered  in  a  standard  format.  Furthermore,  animated  visualizations  prevent 
multiple  viewers  from  simultaneously  viewing  different  times  within  the  animation. 

The  issues  created  by  the  “classical”  approaches  to  dynamic  network  visualization  can 
be  partially  addressed  through  effective  interactions  and  algorithms,  and  much  research  has 
been  devoted  to  such  iterative  improvements.  However,  the  visualization  community  is  still 
awaiting  broad  acceptance  of  a  “workhorse  visualization”  for  dynamic  graph  visualization. 
While  not  originally  intended  as  general  purpose  solution  for  dynamic  graph  visualization, 
storyline  visualization  HD  SOI  [sol  EU  holds  promise  for  this.  The  contribution  of  this  paper 
is  the  adaptation  of  storyline  visualization  techniques  to  dynamic  network  visualization- 
storyline  visualization  of  events  on  a  network  (SVEN)-an  effective  visualization  technique 
for  both  flavors  of  dynamic  networks  described  above. 

Consistent  with  previous  storyline  visualization  techniques,  SVEN  draws  nodes  as  con¬ 
tiguous  lines,  and  time  is  encoded  on  the  horizontal  dimension.  Nodes  that  are  interacting 
are  drawn  close  to  each  other,  and  interactions  are  drawn  as  curved  vertical  lines  that 
connect  the  interacting  nodes  at  the  appropriate  time.  SVEN  contains  an  optimization 
algorithm  to  improve  the  aesthetic  quality  of  the  drawing  by  re-arranging  and  straighten¬ 
ing  the  storylines,  and  renders  the  storyline  using  a  metro  map  styling.  What  follows  is  a 
discussion  of  relevant  literature  related  to  dynamic  graph  visualization,  a  brief  outline  of 
the  requirements  and  design  of  SVEN,  and  an  overview  SVEN’s  layout  algorithm,  followed 
by  several  examples  of  layouts  generated  by  SVEN.  Integration  of  SVEN  into  a  visual 
analytics  software  system  and  evaluation  of  SVEN  is  left  as  future  work. 
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2  Related  Work 


Dynamic  graph  visualization  has  its  origin  in  static  graph  visualization,  and  many  mod¬ 
ern  graph  drawing  algorithms  have  their  basis  in  the  original  Kamada-Kawai  [29]  and 
Fruchterman-Reingold  [22]  “force-directed”  algorithms.  These  algorithms  define  a  model 
of  a  physical  system  from  the  graph  whose  energy  can  be  measured  and  consequently 
minimized  to  produce  an  aesthetically  pleasing  drawing,  ideally.  Popular,  modern,  freely 
available  graph  drawing  packages  include  GraphViz  [T9],  Gephi  [6],  and  D3  [12,  T7], 

Though  not  as  mature  as  static  graph  visualization,  the  field  of  dynamic  graph  visual¬ 
ization  is  growing  quickly.  As  stated  previously,  the  majority  of  dynamic  graph  drawing 
approaches  use  small  multiples  m  or  animation  [351  E],  but  few  have  effective  overviews 
that  portray  all  the  changes  to  the  graph  at  once.  Among  techniques  that  do,  an  often 
repeated  technique  is  to  layer  the  2-D  graph  drawings  from  each  time  slice  into  a  third 
dimension  HUGO].  This  approach  shows  connections  between  nodes  during  a  specific  time 
slice  as  well  as  the  persistence  of  nodes  over  time  slices,  but  the  layering  creates  occlusion 
which  decreases  usability.  CiteSpace-II  analyzes  and  visualizes  trends  in  academic  research 
over  time  [16] .  One  of  its  visualizations  overlays  a  node- link  diagram  on  a  timeline,  with 
nodes  representing  events  such  as  the  publication  of  a  scientific  paper. 

Network  analysts  can  often  still  gain  insight  when  provided  only  the  ego  networks 
(the  subgraph  containing  the  ego  and  its  direct  neighbors)  of  individuals  instead  of  the 
entire  network.  Based  on  this,  Shi  et  al.  developed  a  “1. 5-dimensional”  dynamic  network 
visualization  capable  of  showing  the  ego  network  of  a  particular  individual  of  interest 
over  time  in  a  single  picture  [i8],  Burch  et  al.  adapted  the  classical  parallel  coordinates 
visualization  for  dynamic  networks  }l5j.  In  their  system,  vertices  are  drawn  on  parallel 
axes,  the  area  between  each  axis  represents  a  time  interval,  and  edges  connect  vertices  in 
adjacent  axes.  The  inevitable  problem  of  having  an  overwhelming  number  of  edge  crossings 
for  larger  datasets  is  addressed  by  reducing  the  opacity  of  the  lines  drawn. 

Recently,  there  has  been  increased  interest  in  automated  algorithms  for  “storyline  vi¬ 
sualization,”  a  technique  where  time  is  encoded  in  the  horizontal  dimension  and  characters 
are  drawn  as  contiguous  lines  HU  Sol  m  M  inspired  by  an  XKCD  web  comic  showing 
presence  of  characters  in  scenes  throughout  various  popular  films  [3BJ.  When  groups  of 
characters  interact,  their  storylines  are  closer  within  that  time  interval,  relative  to  time 
intervals  in  which  they  are  not  interacting.  However,  before  detailing  these  works  it  is 
useful  to  provide  a  more  historical  context,  as  there  are  several  older  techniques  that  are 
similar. 

2.1  Predecessors  to  Storyline  Techniques 

To  the  author’s  knowledge,  the  earliest  portrayal  of  events  on  a  network  with  a  spatial 
encoding  of  time  are  “sequence  diagrams,”  developed  in  the  1970s,  which  are  used  to 
understand  timing  and  synchronization  in  distributed  systems  m-  These  visualizations 
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were  originally  hand  drawn;  nodes  were  represented  as  vertical  parallel  lines  with  time 
moving  from  the  bottom  to  the  top  of  the  page.  Communication  events  between  processes 
were  represented  as  wavy  lines  connecting  connecting  the  processes  sending  and  receiving 
the  message  at  the  local  times  the  message  was  sent  and  received.  Sequence  diagrams  are 
effective  for  understanding  trivially  small  networks,  but  become  difficult  to  scale  up  as 
more  nodes  are  added.  The  ordering  of  nodes  in  the  diagram  can  be  chosen  to  minimize 
crossings,  but  this  is  the  Traveling  Salesman  Problem,  and  even  if  an  optimal  solution  was 
found,  there  is  no  guarantee  that  the  resulting  diagram  would  be  interpretable  for  large  or 
dense  networks. 

Visibility  representations,  which  have  existed  since  the  mid  1980s,  are  also  visually 
similar  to  storylines  |29] .  These  are  representations  of  planar  graphs  where  nodes  are  drawn 
as  horizontal  lines,  and  edges  are  drawn  as  vertical  lines  connecting  their  endpoints.  Node¬ 
edge  crossings  are  not  allowed,  so  non-planar  graphs  do  not  have  visibility  representations. 
Due  to  this  constraint,  it  is  not  apparent  how  visibility  representations  can  be  used  as  a 
general  purpose  solution  for  the  dynamic  graph  visualization  problem.  Nor  is  it  apparent 
how  to  overload  the  horizontal  axis  in  a  visibility  representation  to  also  encode  time. 

Dwyer  and  Eades  investigated  using  “columns  and  worms”  to  visualize  dynamic  net¬ 
work  induced  by  movements  of  fund  managers  over  time  [18] .  This  is  a  3-D  visualization 
where  nodes  are  represented  as  contiguous  elements  in  the  time  dimension,  and  re-arranged 
in  two  spatial  dimensions  to  optimize  force-based  graph  aesthetics.  A  disadvantage  of  this 
technique  is  that  perspective  distortion  and  occlusion  can  make  it  very  difficult  to  accu¬ 
rately  interpret  timing  and  ordering  of  events. 

Some  researchers  have  simplified  the  dynamic  graph  visualization  problem  by  increas¬ 
ing  the  level  of  abstraction  and  portraying  how  the  network  communities  change  over  time 
Bang.  Visualizing  this  information  is  inherently  simpler  than  visualizing  the  underlying 
dynamic  network.  Rosvall  and  Bergstrom  apply  a  significance  clustering  technique  to  the 
dynamic  network  at  fixed  time  windows  to  partition  the  vertices  into  groups.  Then  the 
flow  of  nodes  between  clusters  at  consecutive  time  windows  is  visualized  using  a  technique 
similar  to  Sankey  diagrams  [36]  or  parallel  sets  [9j,  which  they  call  “alluvial  diagrams.” 
However,  the  authors  make  no  attempt  to  improve  the  aesthetic  quality  of  their  visualiza¬ 
tions  by  re-arranging  nodes  to  reduce  clutter  (crossings  between  time  windows).  Instead 
nodes  are  ordered  within  each  time  window  according  to  cluster  size,  which  causes  the 
diagrams  to  scale  poorly  as  a  function  of  the  number  of  communities  and  time  windows. 

Another  disadvantage  of  Rosvall  and  Bergstrom’s  technique  is  that  significance  clus¬ 
tering  is  performed  on  time  windows  independently,  which  might  introduce  noise  (nodes 
oscillating  between  clusters  over  time),  adding  clutter  and  artifacts  to  the  visualization. 
Berger- Wolf  and  Saia  introduced  an  optimization  based  approach  for  dynamic  community 
detection  that  overcomes  this  pitfall  |T0j ,  and  this  technique  was  improved  in  [521151].  This 
framework  was  utilized  to  visualize  dynamic  communities  for  several  datasets.  Communi¬ 
ties,  which  span  time,  are  represented  as  stacked  rectangles  that  span  the  horizontal  space 
of  the  visualization.  Similar  to  storyline  visualizations,  nodes  are  represented  as  horizontal 
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lines,  placed  inside  communities,  and  can  only  exist  in  one  community  at  a  time.  When  a 
node  changes  communities  a  diagonal  line  is  used  to  represent  this  change.  Edges  in  the 
network  are  not  shown,  and  no  attempt  is  made  to  optimize  the  ordering  of  the  commu¬ 
nities  or  the  nodes  within  communities.  This  would  likely  reduce  clutter  and  improve  the 
aesthetic  properties  of  the  visualization. 

2.2  Storyline  Techniques 

Several  recent  visualizations  [30,  mmm  appear  to  have  been  directly  inspired  by  a 
series  of  hand  drawn  XKCD  web  comics  [36]  detailing  character  interactions  in  several 
popular  films.  These  visualizations  all  have  a  very  similar  visual  appearance,  and  will 
be  referred  to  as  “Storyline  Visualizations”  in  this  paper.  Kim  et  al.  use  storylines  to 
simultaneously  portray  birth,  death,  marriage,  and  divorce  within  a  genealogical  diagram 
[30].  Their  layout  method  is  effective  at  producing  an  aesthetically  pleasing  drawing,  but 
makes  structural  assumptions  about  the  genealogical  graph,  preventing  this  technique  from 
being  applied  to  the  general  dynamic  graph  drawing  problem.  Concurrently,  Ogawa  and 
Ma  developed  “software  evolution  storylines,”  intended  to  be  more  generally  applicable  to 
dynamic  graph  drawing  [41] .  which  was  later  improved  by  Tanahashi  and  Ma  [50].  Due  to 
the  challenging  multi-objective  nature  of  the  problem,  their  approach  uses  an  evolutionary 
algorithm  to  simultaneously  improve  line  wiggles,  line  crossovers,  and  whitespace  gaps. 

However,  evolutionary  algorithms  are  often  computationally  intensive,  and  when  alter¬ 
native  solutions  exist,  they  are  frequently  desirable.  For  this  reason,  Liu  et  al.  improved  on 
previous  work  by  developing  a  multi-stage  hybrid  optimization  technique  with  essentially 
the  same  problem  domain  and  aesthetic  criteria  [M] .  Their  contributions  were  algorithmic 
complexity  improvements  and  the  inclusion  of  hierarchical  constraints  on  the  interactions 
to  improve  the  correctness  of  the  drawings.  These  latter  two  storyline  visualization  ap¬ 
proaches  both  assume  the  data  can  be  described  in  terms  of  “interaction  sessions,”  which 
are  essentially  data  about  the  time,  duration,  and  participants  in  an  interaction  event. 
However,  it  is  assumed  that  an  individual  cannot  be  in  more  than  one  interaction  session 
at  any  given  time,  which  continues  to  impose  a  certain  structure  on  the  data  being  vi¬ 
sualized.  Thus,  none  of  the  storyline  visualization  techniques  to  date  can  be  considered 
solutions  for  general  purpose  dynamic  graph  visualization. 

3  Requirements  &;  Design 

Dynamic  graph  visualizations  based  on  small  multiples  or  animation  optimize  aesthetics, 
but  do  not  encode  time  spatially.  Conversely,  many  of  the  dynamic  graph  visualization 
techniques  (besides  storyline  visualizations)  that  do  encode  time  spatially  do  not  effectively 
optimize  aesthetics.  Therefore,  we  propose  two  fundamental  requirements  for  effective 
dynamic  graph  visualization: 
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•  Use  space  to  represent  time  (Rl),  and 

•  Reduce  clutter  (R2)  by  minimizing  the  following 

—  node  and  edge  crossings  (R2a) 

—  node  bends  (R2b) 

—  edge  length  (R2c) 

We  make  a  spatial  encoding  of  time  the  first  and  most  important  requirement  for 
effective  dynamic  graph  visualization.  Encoding  time  spatially  has  one  of  the  longest 
lasting  precedents  in  the  entire  held  of  information  visualization-the  technique  has  been 
used  for  hundreds  of  years  and  many  historical  hand  drawn  examples  exist  [2  •  The  second 
requirement  inherits  directly  from  the  established  visual  aesthetics  from  classical  node-link 
drawing  techniques,  and  states  that  effective  dynamic  graph  visualization  should  contain 
an  algorithm  that  rearranges  visual  elements  to  decrease  line  crossings,  line  bends,  and  line 
length.  Below  we  describe  how  SVEN  is  designed  with  these  requirements  in  mind. 

Javed  and  Elmqvist  compiled  a  useful  set  of  common  design  patterns  for  composite 
visualizations  [28l  that  we  find  helpful-they  allow  us  to  describe  how  SVEN  is  constructed 
from  simple  visualization  patterns.  In  the  language  of  their  framework,  SVEN  consists  of 
juxtaposed  arc  diagrams  (one  per  time  window)  with  integrated  nodes.  Arc  diagrams  are  a 
well  known  graph  drawing  technique  where  nodes  are  ordered  and  drawn  in  one  dimension, 
and  links  are  drawn  as  arcs  of  varying  radius  depending  on  the  distance  between  nodes  [23- 
Juxtaposition  describes  the  placement  of  two  or  more  views  side-by-side,  and  integration 
is  the  practice  of  drawing  lines  to  connect  related  objects  in  separate  views.  In  our  case, 
when  one  node  is  present  in  two  adjacent  (juxtaposed)  arc  diagrams,  they  are  connected 
by  a  line  segment.  When  detailed  time  information  about  events  within  a  time  window  is 
known,  the  arcs  within  the  arc  diagram  are  shifted  to  the  left  or  right  accordingly. 

An  example  of  constructing  a  SVEN  drawing  by  juxtaposing  arc  diagrams  is  shown  in 
Fig.  [2}  For  clarity,  the  four  nodes  are  assigned  the  same  four  levels  in  each  arc  diagram. 
However,  this  is  not  always  the  case,  which  may  cause  the  integrating  lines  to  cross,  or  to 
be  drawn  diagonally  (instead  of  horizontally).  In  SVEN,  the  integrating  lines  are  always 
horizontal  within  a  time  window;  crossing  and  diagonal  lines  occur  only  between  time 
windows.  We  refer  to  the  crossing  of  integration  lines  between  time  windows  as  “node¬ 
node  crossings.”  Additionally,  edges  (arcs  in  the  time  windows)  may  be  crossed  by  nodes 
(lines  integrating  adjacent  time  windows),  and  we  refer  to  this  as  “node-edge  crossings.” 
A  third  type  of  crossing  occurs  between  edges  within  each  time  widow  are  “edge-edge 
crossings,”  but  at  the  present  time  SVEN  does  not  directly  minimize  edge-edge  crossings 
(Fig.  |  contains  several  edge-edge  crossings).  Fig.  [3] shows  a  idealized  SVEN  drawing  that 
contains  one  node-node  and  one  node-edge  crossing. 

The  ordering,  juxtaposition,  and  integration  of  arc  diagrams  for  each  time  window  di¬ 
rectly  supports  Rl,  by  providing  a  spatial  representation  of  time  in  the  composite  visual¬ 
ization.  Furthermore,  the  use  of  arc  diagrams  (instead  of  node-link  diagrams,  for  example) 
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Figure  2:  (a)  A  single  arc  diagram  (b)  Juxtaposed  arc  diagrams  (c)  Integrated  &  juxtaposed 
arc  diagrams  (SVEN) 


Figure  3:  An  simple  SVEN  rendering  with  two  time  windows  that  contains  one  node-node 
crossing  (a)  and  one  node-edge  crossing  (b). 

makes  more  effective  use  of  screen  space  by  constructing  a  two-dimensional  drawing  out  of 
a  sequence  of  one  dimensional  drawings.  The  ordering  of  each  node  within  each  time  win¬ 
dow  can  be  adjusted  to  minimize  node-node  and  node-edge  crossings,  supporting  R2a  and 
R2c.  The  spacing  between  nodes  within  each  time  window  can  be  adjusted  to  straighten 
the  integrating  lines,  supporting  R2b. 

Past  storyline  visualization  techniques  '30.  iJd)  150,  ,34]  require  that  storylines  move  closer 
while  the  participants  are  interacting  and  then  move  apart  afterwards.  We  do  not  make 
this  an  explicit  requirement  in  SVEN  because  interaction  events  are  visually  mapped  onto 
arcs  in  the  diagram,  and  for  very  complex  interactions  it  may  impossible  or  ambiguous 
to  encode  interaction  as  proximity.  Rather,  proximity  is  more  of  a  secondary  visual  cue 
in  SVEN,  and  is  minimized  indirectly  during  the  determination  of  node  order  within  each 
time  window.  We  also  note  that  many  two-dimensional  static  graph  drawing  algorithms 
make  the  same  tradeoff  via  force  directed  energy  minimization  schemes.  Few  algorithms 
explicitly  explicitly  solve  the  layout  based  on  the  constraint  that  the  nearest  neighbors  of 
a  node  in  the  graph  drawing  are  also  its  neighbors  in  the  graph,  with  Ref.  m  being  a 
rare  exception.  Finally,  we  note  that  omitting  this  requirement  simplifies  SVEN’s  layout 
algorithm,  which  we  describe  in  detail  in  the  following  section. 
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4  Layout  Algorithm 

SVEN  has  five  basic  steps  to  transform  raw  data  into  a  dynamic  graph  drawing: 

1.  Ingest:  Raw  data  is  transformed  into  an  graph  sequence  or  event  sequence  parti¬ 
tioned  into  time  windows,  and  from  this  an  aggregate  graph  is  constructed. 

2.  Order:  The  ordering  of  nodes  within  each  time  window  is  chosen  to  minimize  line 
crossings  and  length. 

3.  Align:  As  many  nodes  as  possible  are  chosen  for  alignment  to  minimize  line  bends. 

4.  Place:  Spacing  between  nodes  is  chosen  to  assign  positions  based  on  the  ordering 
and  alignment,  and  to  further  minimize  edge  length. 

5.  Render:  The  nodes  and  links  are  drawn,  colored,  labeled,  and  interpolated  as  ap¬ 
propriate. 

All  optimization  of  aesthetics  (R2)  occurs  during  the  ordering,  alignment,  and  place¬ 
ment  phase,  which  is  similar  to  the  multi-stage  approach  taken  by  Liu  et  al.  [34J.  They 
first  order  and  align  the  storylines  using  discrete  optimization  techniques,  and  then  deter¬ 
mine  the  placement  of  the  storylines  by  adjusting  the  space  between  the  storylines  using 
a  continuous  optimization  strategy  (i.e. ,  quadratic  programming).  Our  overall  approach 
is  similar,  but  the  techniques  we  use  within  these  stages  differ  significantly,  and  where 
appropriate  we  highlight  these  differences.  Below  we  describe  our  five  steps  in  detail. 

4.1  Ingestion 

The  data  ingestion  phase  is  responsible  for  taking  the  raw  data  from  the  user  and  trans¬ 
forming  it  into  a  sequence  of  events  or  induced  graphs.  This  phase  is  application  specific; 
interested  readers  can  find  a  comprehensive  discussion  of  dynamic  networks  and  their  ap¬ 
plications  in  Ref  |26| .  Below  we  describe  some  of  the  preprocessing  that  occurs  immediately 
after  ingestion. 


4.1.1  Aggregate  Graph 


Given  a  sequence  of  graphs  induced  from  time  windows,  an  aggregate  graph  Gagg  = 
(Vagg,  Eagg)  is  constructed.  If  an  edge  (u,v)  £  £)  then  (i,v))  £  Eagg  with  the  same 

weight  as  the  original  edge.  Additionally,  if  u  £  V)  n  Vj+i  then  ( i  +  l,u))  £  Eagg. 

The  weight  of  this  edge  can  be  specified  by  the  user,  and  larger  values  will  make  the  cor¬ 
responding  node  more  likely  to  be  straightened  across  time  windows.  An  example  of  an 


aggregate  graph  is  shown  in  Fig.  4.1.2 
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Figure  4:  The  aggregate  graph  for  the  first  three  acts  of  Macbeth.  Solid  lines  indicate  edges 
within  the  original  graphs,  and  dashed  lines  indicated  edges  between  the  adjacent  graphs. 


4.1.2  Continuation  &  Discretization 

Optionally,  a  more  classical  view  of  the  storyline  can  be  accomplished  via  continuation  of 
the  storylines.  This  is  done  by  calculating  vmin  and  vmax ,  the  first  and  last  appearances  of 
a  given  node  v.  For  a  given  time  window  i,  if  vmin  <  i  <  vmax,  then  v  is  added  to  Vi  if  it 
is  not  already  present.  The  inverse  of  this  operation  is  discretization,  where  v  is  removed 
from  Vi  if  its  degree  in  Gi  is  0.  When  necessary,  these  operations  are  performed  prior  to 
construction  of  the  aggregate  graph. 

4.2  Ordering 

The  purpose  of  the  ordering  phase  is  to  determine  an  ordering  of  the  nodes  within  each  time 
window  in  a  manner  that  supports  R2.  A  good  ordering  can  greatly  reduce  crossings,  of 
which  there  are  two  types:  node- node  and  node-edge.  Node- node  crossings  occur  between 
time  windows  when  two  nodes  exist  in  adjacent  scenes  and  their  relative  ordering  is  flipped 
going  from  one  scene  to  the  next.  Node-edge  crossings  occur  within  a  time  window  and  are 
a  result  of  connected  nodes  not  being  placed  at  adjacent  levels  in  the  graph.  An  example 
of  the  two  types  of  crossings  is  shown  in  Fig.  [3j 

The  problem  of  minimizing  node-edge  crossings  and  node-node  crossings  exists  in  a 
tradeoff  space.  If  each  node  was  assigned  a  unique  level,  then  there  would  be  no  node-node 
crossings,  but  this  would  most  likely  produce  a  drawing  with  a  large  number  of  node¬ 
edge  crossings  and  poor  aesthetics.  Similarly,  if  the  node  edge  crossings  were  somehow 
minimized  for  each  time  window  independently,  then  we  would  likely  see  many  node-node 
crossings.  Therefore,  because  these  two  aesthetic  properties  are  in  competition  with  each 
other,  it  is  the  job  of  the  ordering  algorithm  to  balance  the  two,  and  provide  a  means  for 
the  user  to  determine  a  weighting  on  their  relative  importance.  We  approach  this  problem 
by  performing  matrix  seriation  j33]  on  the  aggregate  graph,  which  induces  an  ordering  of 
the  nodes  within  each  time  window.  There  are  several  approaches  to  the  matrix  seriation 
problem,  and  we  consider  two  of  the  most  popular:  spectral  seriation  and  dendrogram 
seriation. 
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Our  spectral  seriation  approach  is  equivalent  to  a  one  dimensional  “spectral  layout”  of 
the  weighted  graph  [23],  and  is  also  very  similar  to  the  “Laplacian  eigenmap”  embedding 
and  dimension  reduction  technique  [7J.  Essentially,  the  eigenvector  corresponding  to  the 
smallest  nonzero  eigenvalue  of  the  graph  laplacian  matrix  is  found.  The  nodes  are  then 
sorted  according  to  this  eigenvector  to  produce  the  ordering.  This  technique  results  in  a 
very  good  approximation  of  the  globally  optimal  solution,  and  scalable  sparse  implemen¬ 
tations  are  readily  available. 

Liu  et  al.  illustrated  the  usefulness  of  placing  hierarchical  constraints  on  the  ordering  of 
storylines  in  order  to  maintain  correctness  of  the  drawing.  For  this  reason  we  consider  an 
alternative  ordering  method,  dendrogram  seriation,  which  is  most  commonly  known  for  its 
use  in  the  “cluster  heat  map”  [56] .  Dendrogram  seriation  addresses  the  problem  of  ordering 
the  leaves  in  a  tree  to  place  similar  leaves  (according  to  some  arbitrary  metric)  near  to 
each  other.  This  ordering  must  be  consistent  with  the  tree  structure,  and  only  2n~1  of  the 
n\  possible  permutations  of  the  leaves  meet  this  requirement.  There  are  several  algorithms 
which  can  solve  this  problem  optimally  in  polynomial  time  mm-  Then  there  is  the  matter 
of  finding  the  tree  in  the  first  place.  The  tree  may  be  determined  fully  or  partially  from  the 
hierarchical  constraints  placed  on  the  nodes  by  the  user.  However,  it  may  also  be  useful 
to  generate  the  tree  using  a  community  detection  algorithm,  and  many  algorithms  exist 
for  this  problem  [391  ET].  Many  of  these  algorithms  are  designed  to  find  partitions  of  the 
network  with  high  modularity-where  clusters  of  nodes  have  many  intra-cluster  edges  and 
few  inter-cluster  edges.  This  property  is  desirable  to  help  minimize  edge  crossing  (R2a) 
and  length  (R2c),  as  these  nodes  will  be  placed  close  to  each  other. 

4.3  Alignment 

The  purpose  of  the  alignment  stage  is  to  improve  the  readability  of  the  drawing  by  straight¬ 
ening  nodes,  or  removing  “wiggles.”  The  drawing  has  a  wiggle  whenever  a  node  is  placed 
at  a  different  height  in  two  adjacent  time  windows.  In  Lin  et  al.’s  alignment  phase,  wiggles 
are  minimized  by  solving  the  longest  common  subsequence  (LCS)  problem  once  per  time 
window,  assuming  the  node  order  in  the  previous  and  current  time  window  are  the  se¬ 
quences.  While  apparently  effective  for  the  datasets  presented  in  the  paper,  we  believe  this 
is  not  the  best  approach  for  the  problem.  Fig.  [5]  demonstrates  a  typical  case  where  LCS 
alignment  performs  poorly.  In  general  the  LCS  technique  will  not  scale  effectively,  as  only 
one  (potentially  small)  subsequence  gets  aligned,  leaving  the  remaining  nodes  unaligned. 

Instead,  we  propose  solving  this  problem  by  framing  the  alignment  problem  as  a  max¬ 
imum  weighted  independent  set  problem  (MWIS).  To  do  so  we  must  first  understand  that 
two  nodes  cannot  be  simultaneously  aligned  if  those  two  nodes  cross  (e.g.,  nodes  0  and  1  in 
Fig.§,  because  doing  so  would  change  their  ordering.  Therefore,  we  can  find  all  pairs  of 
nodes  that  cross  and  construct  a  constraint  graph  Gc.  For  a  pair  of  adjacent  time  windows, 
the  nodes  of  the  constraint  graph  are  those  nodes  that  are  present  in  both  time  windows 
(i.e.,  nodes  that  will  be  connected  with  integrating  lines).  Two  nodes  are  connected  in 


11 


Distribution  A:  Approved  for  public  release.  88ABW  Cleared  10/31/2014;  88ABW-2014-5108. 


LCS 


MWIS/Optimal 


Figure  5:  Longest  common  subsequence  alignment  vs  maximum  weighted  independent  set 
(optimal).  For  this  example,  the  longest  common  subsequence  has  length  1,  and  only  one 
node  will  be  aligned  regardless  of  which  subsequence  is  chosen.  A  greedy  MWIS  algorithm 
will  find  two  nodes  to  be  aligned,  which  is  optimal  for  this  example. 


the  constraint  graph  if  their  integrating  lines  cross,  which  can  be  found  directly  from  the 
ordering  of  the  nodes.  A  set  of  feasible  nodes  to  align  is  a  subgraph  of  Gc  that  has  no  edges. 
Finding  the  largest  such  subgraph  will  align  the  most  nodes,  which  is  precisely  the  maximal 
independent  set  problem.  If  the  nodes  are  weighted  according  to  importance  by  the  user 
(e.g.,  how  important  it  is  for  a  particular  node  to  be  aligned),  the  problem  becomes  the 
MWIS  problem  where  the  objective  is  to  find  a  subgraph  of  Gc  that  contains  no  edges  and 
whose  node  weight  sum  is  the  largest.  While  being  an  NP-hard  problem  and  the  subject  of 
much  research,  several  simple  greedy  heuristics  suffice  as  a  fast  approximations  with  good 
provable  bounds  [35], 

4.4  Placement 

In  the  placement  phase  the  vertical  positions  of  each  node  are  determined,  subject  to  the 
ordering  and  alignment  constraints  found  in  the  previous  stages  of  the  layout  algorithm. 
In  this  stage,  the  empty  space  between  nodes  should  be  effectively  used  to  shorten  edge 
lengths  as  much  as  possible,  supporting  R2c.  Similarly,  Liu  et  al.  discuss  a  “compaction” 
phase,  whose  objective  is  to  minimize  wiggles  and  unnecessary  whitespace.  This  is  the 
continuous  portion  of  their  hybrid  layout  algorithm,  and  they  rely  on  a  quadratic  program 
solver  to  perform  the  optimization.  Instead,  we  are  able  to  phrase  our  placement  problem 
in  a  graphical,  rather  than  numerical  framework,  allowing  the  optimal  result  to  be  quickly 
found.  Given  the  ordering  and  alignment  constraints  we  first  construct  a  weighted  directed 
acyclic  graph  (DAG).  Then,  we  use  the  network  simplex  algorithm  |23l  to  compute  an 
optimal  level  assignment  for  the  DAG.  The  levels  found  are  used  directly  as  the  heights  of 
the  nodes  in  the  final  SVEN  layout.  We  found  the  naive  0(\  V\2  •  |E|)  version  of  the  network 
simplex  algorithm  was  straightforward  to  implement  and  ran  quick  enough  to  be  effective 
for  the  datasets  we  examined.  Next  we  provide  some  more  details  on  how  to  construct  the 
DAG  described  above. 

The  first  step  is  to  determine  the  groups  of  nodes  that  are  being  aligned.  This  can 
be  accomplished  by  constructing  a  graph  containing  an  edge  ((i,v),(i  +  l,w))  if  node  v 
was  aligned  between  time  window  i  and  i  +  1.  We  denote  C(i,v )  to  be  the  connected 
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Figure  6:  Illustration  of  how  to  construct  a  weighted  DAG  representing  the  ordering  and 
alignment  constraints,  (a)  The  connected  component  classes  of  the  aligned  nodes  are 
connected  with  directed  edges  when  adjacent.  The  small  circles  represent  nodes  within 
each  time  window,  and  the  larger  rounded  boxes  represent  the  connected  component  classes 
resulting  from  the  alignment,  (b)  Each  edge  in  Ej  from  time  window  i  contributes  its  weight 
to  each  edge  in  the  DAG  it  spans. 


component  class  of  node  v  in  time  window  i  and  s(i,j)  as  the  jth  node  in  time  window 
i.  The  DAG,  is  constructed  by  adding  edges  between  connected  component  classes  that 
enforce  the  ordering  previously  found  for  each  time  window,  and  these  edges  have  the 
form  (C(i,s(i,j)),C(i,s(i,j  +  1))).  An  edge  (u,  v)  in  Gi  contributes  a  weight  of  w(u,v ) 
to  all  edges  in  the  DAG  of  the  form  {C (i,  s(i,  j)) ,  C(i,  s(i,  j  +  1)))  where  s(i,u )  <  s(i,j )  < 
s(i,v).  For  clarity,  Fig.  [6]  illustrates  how  to  construct  the  DAG  containing  the  ordering 
and  alignment  constraints. 

4.5  Rendering 

We  render  the  storylines  using  a  metro  map  design,  wherever  possible  drawing  straight 
lines  with  smooth  bends  and  nameable  colors.  When  a  node  appears  in  consecutive  time 
windows,  their  storyline  is  interpolated  between  the  time  windows.  A  node’s  storyline 
disappears  when  that  node  is  not  present  in  the  following  time  window.  The  ends  of  the 
storylines  are  capped  with  a  arrows  if  node’s  storyline  reappears  later  (or  previously), 
otherwise  they  are  capped  with  circles.  There  are  likely  to  be  more  nodes  than  nameable 
colors,  so  an  application  specific  means  for  assigning  colors  is  needed.  By  default,  we  assign 
nameable  colors  to  the  most  frequently  occurring  nodes  in  the  graph,  and  a  light  gray  color 
to  the  remaining  nodes. 
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5  Results 


Here  we  briefly  show  several  examples  of  layouts  generated  using  SVEN  from  a  variety  of 
datasets. 

Fig-  0  demonstrates  the  utility  of  SVEN’s  alignment  and  placement  stages.  It  shows 
changes  in  the  weekly  NCAA  Football  rankings  from  the  USA  Today  Pol0  throughout  the 
2013  season.  Only  the  top  10  teams  from  each  week  are  shown.  Fig.  |7(a)|  shows  a  direct 
mapping  of  the  changing  rankings  using  SVEN’s  metro  map  rendering  style  without  any 
optimization  (i.e.,  there  is  1  unit  of  space  between  each  team).  Fig.  |7(b)[  demonstrates 
the  use  of  SVEN’s  alignment  and  placement  stages,  while  maintaining  the  ordering  from 
the  ranking.  We  noticed  that  in  this  dataset,  generally  rank  increases  in  the  top  ranked 
teams  are  small  whereas  rank  decreases  are  large.  A  rank  increase  for  some  team  A  will 
generally  occur  when  another  higher  ranked  team  B  loses  a  game,  causing  B  to  drop  below 
A.  Therefore,  in  MWIS  algorithm  nodes  are  weighted  to  prefer  aligning  nodes  with  rank 
increases  over  time  (i.e.,  weights  provided  to  the  MWIS  algorithm  are  1  unless  a  team’s 
rank  decreases,  in  which  case  the  weight  is  0),  which  is  shown  in  Fig.  |7(c)  The  effect  of 
this  weighting  is  a  representation  of  the  data  that  highlights  sudden  decreases  in  a  team’s 


rank.  We  note  that  the  representation  in  Fig.  7(c)  is  not  a  linear  encoding  of  the  ranks 


which  makes  reading  the  true  rank  of  the  team  directly  from  the  diagram  more  difficult, 
but  relative  rankings  are  more  readily  apparent. 

Fig.  [8]  is  a  representation  of  the  dynamics  occurring  during  cell  division  for  a  simplified 
model  of  budding  yeast  [32].  During  this  process,  various  proteins  interact  by  activating  or 
inhibiting  other  proteins,  or  naturally  decay  via  degradation.  This  carefully  choreographed 
sequence  of  events  is  responsible  for  regulating  each  phase  of  the  cell  division.  The  process 
can  involve  hundreds  of  different  proteins,  but  the  dynamical  model  developed  by  Li  et 
al.  [32]  greatly  summarizes  the  dynamics  in  terms  of  just  a  handful  of  key  players.  The 
result  of  their  analysis  is  a  discrete-time  discrete-state  description  of  the  cell-cycle  dynam¬ 
ics.  Combined  with  the  protein  interaction  network,  we  can  deduce  which  proteins  were 
responsible  for  activation  and  inhibition  between  each  discrete  time  step,  and  we  represent 
these  as  (t,  (u,v))  events,  to  be  read  as  “at  time  t  protein  u  contributed  to  the  activation 
or  inhibition  of  protein  v.”  Note  that  more  than  one  protein  can  contribute  (i.e.,  v  can 
have  an  in-degree  that  is  greater  than  1).  We  also  ensure  that  if  protein  u  is  denoted  as 
active  by  the  dynamical  system  model  at  time  step  i,  then  node  u  is  present  in  Gi  in  the 
visualization.  Additionally,  because  the  dynamics  represent  a  cycle,  where  the  final  state 
and  starting  states  are  consistent,  we  ensure  that  this  is  accounted  for  in  the  ordering,  and 
alignment  stages  in  SVEN.  The  effect  of  this  is  seen  in  Fig.  [8]  where  Sicl  and  Cdhl  are 
present  at  the  same  level  within  the  first  and  last  time  window. 

The  “Newcomb  Fraternity”  dataset  is  a  classical  longitudinal  study  of  social  dynam¬ 
ics  mmm-  It  contains  fifteen  17  x  17  subjective  ranking  matrices  that  represent 


1source:  ESPN  http://espn.go.com/college-football/rankings 
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(b) 


(c) 


Figure  7:  Top  10  NCAAF  teams,  ranked  by  the  USA  Today  poll,  throughout  the  2013 
season,  (a)  Direct  mapping  of  the  rankings  onto  the  vertical  axis  (b)  Rankings  with  ad¬ 
justed  spacing  using  SVEN’s  alignment  and  placement  stages,  (c)  Weighting  in  the  MWIS 
algorithm  chosen  to  straighten  rank  increases. 
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Figure  8:  Budding  yeast  cell  cycle  modeled  by  |32j.  Edges  represent  activation  or  inhibition, 
which  occurs  between  discrete  time  steps  in  the  model. 


seventeen  individuals’  rankings  of  the  other  16  fraternity  members,  where  a  low  ranking 
indicates  that  member  views  the  other  member  as  a  close  friend.  The  fifteen  versions  of 
the  ranking  matrices  represent  measurement  of  these  rankings  over  time.  This  dataset 
is  transformed  into  a  sequence  of  graphs  as  follows.  Let  rb  be  the  subjective  ranking  of 
individual  j  by  individual  i  at  time  t.  Then  Et  contains  the  edge  (i,j)  if  and  only  if  r*| -  <  e 
and  r'L  <  e,  where  e  is  an  arbitrarily  chosen  threshold.  This  results  in  an  induced  graph  for 
each  time  window  that  represents  close  and  reciprocated  friendships  (we  let  e  =  2,  as  this  is 
the  smallest  e  that  allows  for  non-trivial  structures).  Fig.  [9] shows  the  SVEN  visualization 
of  this  dataset. 


Figure  9:  The  Newcomb  Fraternity  longitudinal  study  mmm-  The  sequence  of  graphs 
is  induced  by  considering  close  reciprocated  friendships. 


Dynamic  network  visualizations  can  be  useful  to  understand  the  evolution  of  discourse 
(i.e.,  a  sequence  of  speakers)  to  understand  who  talks  to  whom,  and  how  these  relation¬ 
ships  change  over  time.  To  process  dialogue  into  a  dynamic  network,  first  a  parser  must 
extract  the  sequence  of  speakers  {si,S2,...}  from  the  raw  text  data.  Then,  this  sequence 
is  transformed  into  a  set  of  network  events  of  the  form  (t,  (sj,Sj+i))  where  t  is  the  line 
number  and  s*  and  Sj+i  are  consecutive  speakers.  The  event  sequence  is  then  partitioned 
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into  time  windows;  a  good  discretization  of  time  would  be  to  follow  the  act  or  scene  breaks 
(in  the  case  of  a  play)  or  chapters  (in  the  case  of  a  book),  as  these  are  the  natural  temporal 
breaks  decided  upon  by  the  author.  Fig.  [lO]  shows  SVEN  visualizations  of  the  dialogue  in 
two  well  known  Shakespeare  play^} 


Figure  10:  Dialogue  from  Shakespeare  plays,  (a)  Midsummer  Night's  Dream  (b)  Macbeth 


6  Conclusions  &;  Future  Work 

We  have  presented  a  preliminary  algorithm  for  adapting  storyline  visualization  techniques 
to  the  problem  of  dynamic  graph  visualization.  Our  solution  is  derived  from  two  basic 
requirements:  representing  time  using  the  horizontal  axis  (Rl)  and  reducing  clutter  by 
rearranging  the  storylines  along  the  vertical  axis  (R2).  We  approach  this  challenging 
problem  by  breaking  the  optimization  into  multiple  stages  (ordering,  alignment,  and  place¬ 
ment)  similar  to  previous  work  [3JJ.  However,  we  differ  significantly  from  previous  work 
in  how  we  accomplish  these  sub-tasks.  Ordering  of  nodes  is  quickly  and  effectively  accom¬ 
plished  by  seriation  of  the  aggregate  graph  matrix  using  spectral  methods  (alternatively, 
hierarchical  graph  clustering  and  dendrogram  seriation  can  be  used).  Alignment  of  the 
nodes  is  performed  by  determining  node  crossings  between  adjacent  time  windows,  and 
solving  the  maximum  weighted  independent  set  (MWIS)  problem  to  maximize  the  number 
of  straightened  nodes.  Then  the  placement  of  the  nodes  is  optimized  using  the  network 
simplex  algorithm. 

2source:  Project  Gutenbergwww.gutenberg.org 
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We  have  applied  SVEN  to  a  variety  of  datasets.  We  showed  how  SVEN  can  be  used  to 
optimize  the  placement  of  rankings  over  time  and  that  the  weighting  of  nodes  for  the  MWIS 
algorithm  can  have  a  beneficial  effect  on  the  readability  of  the  diagram  when  properly 
chosen.  We  utilized  SVEN  to  visualize  the  complex  dynamics  produced  by  a  model  of  the 
budding  yeast  cell  cycle,  clarifying  the  sequence  of  activation,  inhibition,  and  degradation 
that  regulates  the  process.  SVEN  was  used  to  visualize  a  classical  longitudinal  study  of 
social  acquaintances  in  a  fraternity,  highlighting  close  friendships  and  changing  groups. 
Finally,  SVEN  was  used  to  show  the  sequence  of  speakers  in  two  well  known  Shakespeare 
plays.  Much  future  work  remains,  and  this  work  falls  into  three  categories:  implementing 
interactivity,  performing  evaluation,  and  addressing  issues  pertaining  to  scalability. 

6.1  Interactivity 

Interactivity  is  crucial  to  accommodate  some  of  the  shortcomings  inherent  to  SVEN  visu¬ 
alizations  of  dynamic  networks.  Many  potentially  useful  interactions  fall  into  the  category 
of  “details  on  demand,”  where  additional  information  can  be  given  to  the  user  when  they 
request  it.  For  example,  when  a  node  is  absent  from  a  time  window,  but  present  before 
and  after,  that  node’s  storyline  will  be  broken  making  the  line  harder  to  trace  through  the 
visualization.  A  solution  could  be  to  complete  the  storyline  for  a  given  node  at  the  user’s 
request.  Another  useful  way  to  provide  details  is  to  allow  the  user  to  brush  across  the 
visualization,  selecting  a  subset  of  the  nodes  and  time  simultaneously.  Then  an  alternative 
visualization,  such  as  a  node  link  diagram  or  adjacency  matrix,  can  be  shown  to  the  user 
for  this  particular  subset  where  it  may  be  more  informative  than  the  overview  provided  by 
SVEN. 

Temporal  networks  can  have  a  large  number  of  edges  which  can  cause  clutter.  A  so¬ 
lution  would  be  to  use  transparency  or  other  techniques  to  de-emphasize  these  edges,  but 
then  user  may  want  to  re-emphasize  all  the  edges  incident  to  a  particular  node  over  time 
or  at  a  particular  time  upon  request.  Other  interactions  could  involve  supporting  various 
analyses  relevant  to  temporal  networks,  such  as  temporal  shortest  path.  For  example,  the 
user  could  select  a  node  at  a  particular  time  and  request  that  the  visualization  shows  when 
other  nodes  can  be  reached,  the  minimum  number  of  steps  this  requires,  and  the  particu¬ 
lar  optimal  path(s)  to  accomplish  this.  Additional  application  specific  details  should  also 
be  presented  directly  in  the  visualization,  or  made  available  through  interaction.  Other 
standard  interactions  such  as  adding/deleting  nodes/links  and  re-arranging  the  visualiza¬ 
tion  would  be  important  features  of  a  fully  functioning  visual  analytics  tool  for  dynamic 
network  visualization. 

6.2  Evaluation 

Evaluation  of  SVEN  is  an  important  future  research  activity-many  unanswered  questions 
remain.  Crucially,  we  would  like  empirical  evidence  about  what  combinations  of  data  and 
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task  allow  users  of  SVEN  to  attain  good  performance  relative  to  existing  techniques.  This 
can  be  accomplished  with  a  controlled  user  study  with  simple  tasks  that  have  measurable 
performance.  In  this  case  it  may  be  important  to  generate  the  datasets  from  a  parametrized 
random  model  of  a  temporal  networks,  rather  than  using  real-world  datasets.  To  get  a 
better  understanding  of  how  SVEN  could  provide  insight  to  an  analyst,  the  opposite  is 
the  case-datasets  should  be  taken  from  the  real  world  with  a  particular  application  and 
challenging  question  in  mind.  Evaluation  by  the  “analysts”  would  be  more  subjective 
compared  to  the  controlled  user  study,  but  potentially  more  informative.  Additionally, 
objective  evaluation  of  the  layout  algorithm  against  previous  work,  different  node/link 
weightings,  or  different  heuristics  can  be  accomplished  by  measuring  the  aesthetic  qualities 
of  the  resulting  layouts  (e.g.,  crossings,  wiggles,  line  length). 

6.3  Scalability 

From  a  computational  standpoint,  SVEN  is  scalable  to  large  dataset^j  The  ordering 
stage’s  time  complexity,  when  using  the  spectral  layout  method,  which  requires  computing 
the  eigenvectors  of  a  matrix,  is  0(|V|3)  or  better  depending  on  sparsity.  The  alignment 
stage  requires  determining  the  set  of  nodes  that  cross  between  adjacent  time  windows. 
Our  naive  implementation  simply  compares  all  pairs  of  nodes,  which  has  a  time  complexity 
of  0(|  V|2);  a  more  sophisticated  algorithm  using  bisection  would  have  a  time  complexity 
of  0(|  Vj  log  |V|)  assuming  the  number  of  crossings  per  node  is  bounded  by  a  small  con¬ 
stant.  Following  this,  the  MWIS  algorithm  is  run;  our  naive  implementation  of  this  is 
0(|Vj  •  | El),  but  can  be  improved  by  using  a  heap  implementation  of  a  priority  queue  with 
a  increase/decrease  key  functionality.  The  placement  stage  requires  running  the  network 
simplex  algorithm;  our  naive  implementation  has  a  time  complexity  of  0(|V|2  •  |.E|),  mak¬ 
ing  this  the  most  computationally  intensive  component  of  SVEN.  However,  even  with  an 
overall  cubic  worst  case  complexity,  the  layout  can  be  computed  under  interactive  con¬ 
ditions  (i.e. ,  latency  <  500ms)  for  thousands  of  nodes  and  edges.  We  would  expect  the 
visualization  to  become  cluttered  and  unusable  long  before  the  layout  algorithm  starts  to 
running  too  slowly  from  the  user’s  perspective.  Therefore,  we  suggest  that  future  work 
should  prioritize  addressing  the  problem  of  scalability  through  effective  interaction  and 
abstraction  techniques  over  fine  tuning  of  the  layout  algorithm. 

Common  ways  of  addressing  scalability  issues  are  through  filtering  and  abstraction.  For 
example,  filtering  can  be  used  to  reduce  the  number  of  nodes  and  edges  to  be  shown,  which 
in  turn  makes  the  layout  easier  to  compute  and  potentially  less  cluttered.  Filtering  can  be 
automatic  by  using  degree  of  interest  (DOI)  techniques  to  assign  a  score  to  data  elements, 
and  visualizing  a  relevant  subset  of  this  data  [53].  For  networks,  abstraction  is  frequently 
accomplished  through  clustering  eh  El  in  ESI  m,  where  groups  of  nodes  are  aggregated 
together,  and  represented  as  a  super  node.  In  the  visualization,  more  emphasis  is  placed 

3In  the  analysis  of  SVEN’s  time  complexity,  we  assume  the  number  of  time  windows  is  a  small  constant, 
which  is  bounded  more  by  the  available  screen  space  than  the  data. 
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on  the  clusters  and  the  inter-cluster  links,  because  clusters  are  assumed  to  be  dense,  and 
intra-cluster  links  are  less  informative.  This  technique  can  be  adapted  to  dynamic  networks 
by  leveraging  dynamic  network  clustering  techniques.  A  simple  approach  is  to  compute  a 
hierarchical  clustering  directly  on  the  aggregate  graph,  and  to  use  dendrogram  seriation  to 
compute  the  ordering  of  the  nodes,  allowing  groups  to  be  drawn  directly  on  the  visualization 
in  a  consistent  manner. 
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